Efficient pooling designs for library screening.

نویسندگان

  • W J Bruno
  • E Knill
  • D J Balding
  • D C Bruce
  • N A Doggett
  • W W Sawhill
  • R L Stallings
  • C C Whittaker
  • D C Torney
چکیده

We describe efficient methods for screening clone libraries, based on pooling schemes that we call "random k-sets designs." In these designs, the pools in which any clone occurs are equally likely to be any possible selection of k from the v pools. The values of k and v can be chosen to optimize desirable properties. Random k-sets designs have substantial advantages over alternative pooling schemes: they are efficient, flexible, and easy to specify, require fewer pools, and have error-correcting and error-detecting capabilities. In addition, screening can often be achieved in only one pass, thus facilitating automation. For design comparison, we assume a binomial distribution for the number of "positive" clones, with parameters n, the number of clones, and c, the coverage. We propose the expected number of resolved positive clones--clones that are definitely positive based upon the pool assays--as a criterion for the efficiency of a pooling design. We determine the value of k that is optimal, with respect to this criterion, as a function of v, n, and c. We also describe superior k-sets designs called k-sets packing designs. As an illustration, we discuss a robotically implemented design for a 2.5-fold-coverage, human chromosome 16 YAC library of n = 1298 clones. We also estimate the probability that each clone is positive, given the pool-assay data and a model for experimental errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New Constructions of One- and Two-Stage Pooling Designs

The study of gene functions requires a DNA library of high quality, such a library is obtained from a large mount of testing and screening. Pooling design is a very helpful tool for reducing the number of tests for DNA library screening. In this paper, we present new one- and two-stage pooling designs, together with new probabilistic pooling designs. The approach in this paper works for both er...

متن کامل

To my dear parents ii

REMLINGER, KATJA S. Statistical Design and Analysis of High Throughput Screening Data Using Pooling Experiments and Data Mining Techniques. (Under the direction of Dr. Jacqueline M. Hughes-Oliver and Dr. S. Stanley Young.) Discovery of a new drug involves screening large chemical libraries to identify new and diverse active compounds. Only a very small percentage of the compounds in the library...

متن کامل

YAC Library Pooling Scheme for PCR-Based Screening

The PCR is a rapid method for screening a library of clones for the presence of clones containing an STS, Usually the library is divided into pools of clones, and the PCR is run on each pool. The problem we address here is to design efficient and robust pooling schemes for such PCR-based screening. Two questions are relevant: (1) Given an arbitrary unique sequence, how should one pool a library...

متن کامل

Statistical Design of Pools Using Optimal Coverage and Minimal Collision

Discovery of a new drug involves screening large chemical libraries to identify active compounds. Screening efficiency can be improved by testing compounds in pools. We consider two criteria to design pools: optimal coverage of the chemical space and minimal collision between compounds. Five pooling designs are applied to a public data set. We evaluate each method by determining how well the de...

متن کامل

Statistical Design of Pools Using Optimal Coverage and Minimal Collision -- Institutue of Statistics Mimeo Series 2549

Discovery of a new drug involves screening large chemical libraries to identify active compounds. Screening efficiency can be improved by testing compounds in pools. We consider two criteria to design pools: optimal coverage of the chemical space and minimal collision between compounds. Five pooling designs are applied to a public data set. We evaluate each method by determining how well the de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genomics

دوره 26 1  شماره 

صفحات  -

تاریخ انتشار 1995